WALT: Turning Website Features into Reusable Tools for LLM Agents
'WALT converts latent website functionality into deterministic callable tools for LLM agents, boosting success rates on VisualWebArena and WebArena while cutting action counts.'
Records found: 5
'WALT converts latent website functionality into deterministic callable tools for LLM agents, boosting success rates on VisualWebArena and WebArena while cutting action counts.'
Salesforce AI releases GTA1, a powerful GUI agent that outperforms OpenAI's CUA by leveraging innovative test-time scaling and reinforcement learning techniques to improve task success and action grounding.
Salesforce AI has introduced CRMArena-Pro, the first enterprise-grade benchmark testing LLM agents across complex multi-turn business tasks including sales, customer service, and confidentiality handling.
Salesforce has released BLIP3-o, an open-source multimodal model that unifies image understanding and generation using CLIP embeddings and Flow Matching, achieving state-of-the-art results.
Salesforce AI introduces SWERank, a novel retrieve-and-rerank framework that delivers precise and scalable software issue localization with significantly reduced costs compared to existing agent-based methods.